Purity estimation with separable measurements 
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Given a large number N of copies of a qubit state of which we wish to estimate its purity, we 
prove that separable-measurement protocols can be as efficient as the optimal joint-measurement 
one if classical communication is used. This shows that the optimal estimation of the entanglement 
of a two-qubit state can also be achieved asymptotically with fully separable measurements. Thus, 
quantum memories provide no advantage in this situation. The relationship between our global 
Bayesian approach and the quantum Cramer-Rao bound is discussed. 
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The ultimate goal of quantum state estimation is to 
determine the value of the parameters that fully char- 
acterize a given unknown quantum state. However, in 
practical applications, a partial characterization is often 
all one needs. Thus, e.g., knowing the purity of a qubit 
state or the degree of entanglement of a bipartite state 
may be sufficient to determine whether it can perform 
some particular task — See Ref. Q for recent experi- 
mental progress on estimating the degree of polarization 
(the purity) of light beams. This Letter concerns this 
type of situation. 

To be more specific, assume we are given N identical 
copies of an unknown qubit mixed state p(r), so that the 
state of the total system is p N (f) = [p(r)] ■ The set 
of all such density matrices {p(r)} can be mapped into 
the Bloch sphere B — {f : r = \r\ < 1} through the 
relation p{r) — (1 + r • c?)/2, where a = (a x , <r y , a z ) is 
a vector made out of the three standard Pauli matrices. 
Our aim is to estimate the purity, r, as accurately as 
possible by performing suitable measurements on the N 
copies, i.e., on p N (f). This problem can also be viewed 
as the parameter estimation of a depolarizing channel 3] 
when it is fed with N identical states. 

Estimation protocols are broadly divided into two 
classes depending on the type of measurements they use: 
joint and separable. The former treats the system of N 
qubits as a whole, allowing for the most general mea- 
surements, and leads to the most accurate estimates or, 
equivalently, to the largest fidelity (properly defined be- 
low). The latter, treats each copy separately but classical 
communication can be used in the measurement process. 
This class is particularly important because it is feasi- 
ble with current technology and it offers an economy of 
resources. In this Letter we show that for a sufficiently 
large N, separable measurement protocols for purity esti- 
mation can attain the optimal joint-measurement fidelity 
bound. The power of separable measurement protocols 
in achieving optimal performance has also been demon- 
strated in other contexts 

It has been shown |j| that given N copies of a bipartite 
qubit pure state, \^i)ab, the optimal protocol for mea- 



suring its entanglement consists in estimating the purity 
of p(r) = ti b(\^)ab(^\) : where trg is the partial trace 
over the Hilbert space of party B (see [l.i for related work 
on bipartite mixed states). We thus show that for large 
N this entanglement can be optimally estimated by per- 
forming just separable measurements on one party (party 
A in this discussion) of each of the N copies of \^)ab- 

In this Letter, special attention is paid to the asymp- 
totic regime, when N is large. There are several reasons 
for this. First, in this limit, formulas greatly simplify 
and usually reveal important features of the estimation 
protocol. Second, the asymptotic theory of quantum 
statistical inference, which has become in recent years 
a very active field in mathematical statistics Q, deals 
with problems such as the one at hand. Our results give 
support to some quantum statistical methods for which 
only heuristic proofs exist; e.g., the applicability of the 
integrated quantum Cramer-Rao bound in the Bayesian 
approach y|. 

In the first part of this Letter we state some important 
results concerning the optimal joint estimation protocols 
and give the corresponding fidelity bounds. In addition 
to the general case of states in B, which was partially ad- 
dressed in , we also consider the situation when the un- 
known state is constrained to lie on the equatorial plane 
£ of the Bloch sphere B. In the second part, we discuss 
separable measurement protocols, we prove that they sat- 
urate the joint-measurement bound asymptotically and 
we state our conclusions. 

Mathematically, the problem of estimating the purity 
of p(r) can be formulated within the Bayesian frame- 
work as follows (see 9] for a large deviations approach, 
which is only meaningful in the asymptotic regime). Let 
IZq — {R x } be the set of estimates of r, each of them 
based on a particular outcome x of some generalized mea- 
surement, O, over p N (r). Such measurement is charac- 
terized by a Positive Operator Valued Measure (POVM), 
namely, by a set of positive operators O — {O x } that 
satisfy J2 X O x = 1. A separable measurement is a par- 
ticularly interesting instance of a POVM for which each 
O x is a tensor product of N individual operators (usually 
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projectors) each one of them acting on p(r). 

Next, a figure of merit, f(r,R x ), is introduced as a 
quantitative way of expressing the quality of the purity 
estimation. Throughout this Letter we use 



f(r,R x ) = rR x + ^l-r 2 J\ 
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which we call fidelity for short. Its values are in the 
range [0,1], where unity corresponds to perfect deter- 
mination. This fidelity has a natural interpretation: in 
Uhlmann's geometric representation of the set of density 
matrices as the hemisphere (l/2)§ 3 C R 4 , the function 
D(r,R x ) = (1/2) arccos/(r, R x ) is the geodesic (Bures) 
distance 0] between two sets (two parallel 2-dimensional 
spheres) characterized by the purities r and R x respec- 
tively. 

The optimal protocol is obtained by maximizing 
F{0,1lo) = E X J dpf{r,R x )tY[p N (r)O x ], where dp is 
the prior probability distribution of p(r) , and we identify 
the trace as the probability of obtaining the outcome x 
given that the state we measure upon is p N {r). Thus, 
F is the average fidelity. The maximization is over the 
estimator (guessed purity) TZq and the POVM O, 



F max = max <^ maxF(0,ft o ) 
o 1 TZo 



(2) 



In this formulation, we need to provide a prior prob- 
ability distribution (prior for short) dp, which encodes 
our initial knowledge about p(r). Here we assume to be 
completely ignorant of both n = f/r and r. Our lack of 
knowledge about the former is properly represented with 
the choice dp oc dfl (solid angle element), which states 
that a priori n is isotropically distributed on B. There- 
fore, we write 



dp = — w(r)dr; 
4tt 



dr w(r) 



1. 
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While there is wide agreement on this respect, the r- 
dependence of the prior is controversial and so far we 
will not stick to any particular choice. Nevertheless, 
it is worth keeping in mind that the hard sphere prior 
w(r) = 3r 2 shows up in the context of entanglement esti- 
mation [nj, whereas the Bures prior w(r) = (4/7r)r 2 (l — 
r 2-j-i/2 j g mos ^ na tural in connection with distinguisha- 
bility of density matrices |l2|, Il3l 1 1 ll ] . 

Rather than computing i|2|l. we here present the main 
results (details can be found in |l5j). 

(i) The optimal POVM is defined by the set of opera- 
tors {Ijo! = Ylm \j m > a )(j m ; Each Hj a projects over 
the invariant subspace corresponding to an irreducible 
representation j of SU (2) — the group of unitary transfor- 
mations U that acts naturally over B as p(r) — ► Up(f)W . 
Here the index a (a = 1, 2, . . . , rij) labels the different re- 
occurrences of j. All these rij equivalent representations 
j give an identical contribution to F. This result should 



not come as a surprise. The optimal purity estimate, and 
thus the fidelity, should only depend on invariant quan- 
tities (i.e., j and a), as the purity itself is rotationally 
invariant, and so is our choice for the prior. 

(ii) The optimal purity estimator can be written as 



R. , 



A^ + B 2 )- 1 ' 2 , where 



(Aj, Bj) =J dr w(r) (r, yjl-r 2 ) ^ p 



^-—771 -$r+m 



(4) 



m=—j 



and p r = (1 — r)/2, q r — 1 — p r . We can easily identify 
the sum in (0J as the probability tr[p (r)tj a ]. 
(Hi) The maximum fidelity is given by 



F a 



N 



2 j + 1 \- 

4r + J + 1 . 
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where the coefficient in front of the sum is n 
For large N, this can be computed to be 

= l-^L + (iV- 1 ). 



(6) 



One can also check that at leading order Rj — 2j/N+. . ., 
as one would intuitively expect. These asymptotic results 
hold for any prior w(r). 

It is also interesting to analyze the case where f is 
known to lie on the equatorial plane 8. With this 
information, the prior probability distribution becomes 
dp = (d(f>/2n)w(r)dr, where 4> is the polar angle of the 
spherical coordinates. The group of unitary transfor- 
mations on £ is now a [7(1) subgroup of SU(2) and, 
hence, the optimal POVM is given by the correspond- 
ing one-dimensional projectors over the [/(l)-invariant 
subspaces, {tj am = \jm; a)(jm; a\}. With this, one can 
work out the maximum fidelity. It turns out that asymp- 
totically F max is also given by © and the optimal guess 
is Rj m = 2j/N+. . . (independently of m). The same con- 
clusions also hold in the one-dimensional case of states 
known to lie on a diameter of B. Therefore, we see that 
the information about n becomes irrelevant in the asymp- 
totic limit. 

A word regarding quantum statistical inference is in 
order here. It is often argued that the quantum Cramer- 
Rao bound 0| can be integrated to provide an attain- 
able asymptotic lower bound for some averaged figures 
of merit, such as the fidelity 0J. Ours is a so-called one 
parameter problem for which the quantum Cramer-Rao 
bound takes the simple form Vari? > H^ 1 (f)/N, where 
Vari? = ((R x — (R x )) 2 } is the variance of the estima- 
tor R x , the average is over the outcomes x of a mea- 
surement, H(r) is the quantum information matrix |l7| . 
and R x is assumed to be unbiased: (R x ) = r - I n °ur case 
H(r) = (1— r 2 ) -1 , and the bound is attainable. This pro- 
vides in turn an attainable asymptotic upper bound for 
the fidelity (JTJ, since (f(r,R x )} « 1 - iH(r) Varii + . . .. 
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Assuming one can integrate this relations over the whole 
of B (including the region r w 1, where H{r) is singular), 
with a weight function given by the prior Q, one easily 
obtains Eq. ©. Unfortunately there are only heuris- 
tic arguments supporting this assumption, but so far no 
rigorous proof exists in the literature. 

We now abandon the joint protocols to dwell on 
separable-measurement strategies for the rest of the Let- 
ter. Here we focus on the asymptotic regime, but some 
brief comments concerning small N can be found in the 
conclusions. 

In previous work |18j , some of the authors showed that 
the maximum fidelity one can achieve in estimating both 
r and ft (full estimation of a qubit mixed state) assum- 
ing the Bures prior and using tomography behaves as 
F fun X = 1 - £N~ 3/4 + o{N- 3 / 4 ), where £ is a positive 
constant. The same behavior one should expect for our 
fidelity f max , since the effect of the purity estimation is 
dominant in F^fi*. This strange power law, somehow un- 
expected on statistical grounds, is caused by the behavior 
of w(r) in a small region rwl. Indeed, it is not difficult 
to convince oneself that if w(r) oc (1 — r 2 )~ A « 2(1— r)~ A 



for r»l, one should have 1 — F n 



2-1 



for 



< A < 1 (for A = 0, hard sphere prior, one should have 
logarithmic corrections). This differs drastically from © 
which, as stated above, holds for any such values of A. 
Would classical communication be enough to restore the 
right power law jV" 1 for 1— i^ max and, moreover, saturate 
the bound of the optimal joint-measurement protocol? 

On quantum statistical grounds, one should expect 
a positive answer to this question since the quantum 
Cramer-Rao bound is attained by a separable protocol 
consisting in performing the (von Neumann) measure- 
ments M. — {(1 ±n • a) /2} on each copy. Note, however, 
that M. depends on n, which is, of course, unknown a 
priori. This protocol can only make sense if we are ready 
to spend a fraction of the TV copies of p{r) to obtain an 
estimate of ft, use this classical information to design M. 
and, finally, perform this adapted measurement on the 
remaining copies. This protocol was successfully applied 
to pure states by Gill and Massar in |lj|]. We extend it 
to purity estimation below. 

Let us consider a family of priors of the form 



w(r) 



4 r(5/2-A) 

A r(i-A) 5 



''(1-r-) 
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which includes both the Bures (A = 1/2) and the hard 
sphere (A = 0) metrics. Despite of this particular r de- 
pendence, the final results apply to any prior whose be- 
havior near r = 1 is given by (JJJ. 

We now proceed a la Gill-Massar and consider the 
following one-step adaptive protocol: we take a fraction 
N a = N a (0 < a < 1) of the N copies of p(r) and we 
use them to estimate ft. Tomography along the three 
orthogonal axis x, y and z, together with a very elemen- 
tary estimation based on the relative frequencies of the 



outcomes |20| , enables us to estimate ft with an accuracy 
given by 



Ml ~i 

2 



(cos0 r ) 



3/1 1 



fj+o(N^), (8) 



where r is the angle between ft and its estimate. Here 
and below (■ • •) is not only the average over the outcomes 
of this tomography measurements, but also contains an 
integration over the prior angular distribution dQ/(4it) 
for fixed r. 

In a second step, we measure the projection of a along 
the estimated ft obtained in the previous step. We per- 
form this von Neumann measurement on each of the re- 
maining N — N = Ni copies of the state p(r). We esti- 
mate the purity to be R = 2N + /Nx — 1, where N±/Ni is 
the relative frequency of ±1 outcomes, and we drop the 
N + dependence of R to simplify the notation. 

Obviously, as a random variable and for large iVi, R is 
normally distributed as R ~ N(rc r , yl — r^^/s/Ni), 
where c r — cosO r . Hence, for large Nq and Ni it makes 
sense to expand f(r,R), Eq. QJ, around R = rc r , and 
thereafter, because of iJSJ, expand the resulting expres- 
sion around c r — 1. We obtain 



r 2 f(Q 2 r ) (€# 



1 



4/Vi 



(9) 



where F(r) is the average fidelity for fixed r, i.e., 
J drw{r)F{r) = F. In view of ©, (6*) - iV" 2 = N~ 2a . 
Hence, the two terms in parenthesis in © can only be 
dropped if a > 1/2. Provided w(r) vanishes as in Q 
with A < 0, we can integrate r in @ over the unit inter- 
val to obtain 



1 



1 



2JV(1 - iV"- 1 ) + ° (7V ^ 



(10) 



and we conclude that this protocol attains asymptotically 
the joint-measurement bound @. 

However, most of the physically interesting priors [ill 
HH ]. w(r), not only do not vanish as r — > 1, but often 
diverge like Q with < A < 1. In this case © cannot be 
integrated, as the last term does not lead to a convergent 
integral. This signals that the series expansion around 
c r = 1 leading to 10 is not legitimated in the whole of B. 

To fix the problem, we split B in two regions. A sphere 
of radius 1 — e, e > 0, which we call B 1 , and a spher- 
ical sheet of thickness e: B 11 — {f : 1 — e < r < 1}. 
The fidelity can thus be written as the sum of the cor- 
responding two contributions: F = F l + F 11 . While F 1 
can be obtained by simply integrating 10 over B 1 , where 
this expansion is valid, some care must be taken in the 
region B u . There, we proceed as follows. 

We compute the fidelity as if all the states in B u had 
the lowest possible purity (r = 1 — e) when the first-step 
tomography was performed. This leads to a lower bound 
for F u , because the lower the purity of a state the less 
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accurately ft can be determined [see Eq. JSJ], and hence, 
the worse its purity can be estimated in the second step. 
The trick, which amounts to replacing c r by Ci_ e , enables 
us to integrate r prior to performing the average (•••). 
A straightforward calculation leads to 

^>i-^-2 A - 2 fcA<eU 2 - A + ---, (ii) 

< A < 1, where k x = 2 2 ~ A r(| - A)r(| - A)r(A - 
2)/[7iT(l — A)]. Now, we can safely take the limit e — > 0. 
We see that by choosing max {1/2, 1/(2 — A)} < a < 1 
we ensure that the joint-measurement bound (Jfjjl is at- 
tained. It is worth emphasizing that the last term in (|llfl . 
which is completely missing in i|10[l , is actually the dom- 
inant contribution if a < 1/(2 — A). For A = we have 
F hard > 1 _ (i/2)JVf 1 - (3/8) A^ 1 (6?) log(6f) + ... and 
we again conclude that the protocol presented here at- 
tains the joint-measurement bound. 

At this point one may wonder if the conclusions above 
depend upon our particular choice of figure of merit. To 
get a grasp on this, it is worth using again the standard 
pointwise approach to quantum statistics. There, one is 
interested in the mean square error MSE R — ((R — r) 2 ) 
for fixed r, where now the average (• • •) is over the out- 
comes of all measurements for a fixed f. One can write 
MSEi? = Vari? + ((R) — r) 2 , where the second term is 
the bias. Using the same one-step adaptive protocol de- 
scribed above, we get that the mean square error after 
step two is MSEi? = [Nrfir)]- 1 + r 2 (9^)/4 + . . .. As 
above, the last term can be dropped if a > 1/2, and 
MSEi? = [NH(r)}- 1 + o[JV _1 ], saturating the quan- 
tum Cramer-Rao bound. This protocol is, therefore, also 
asymptotically optimal in the present context. 

In summary, though the absolute bounds for the av- 
erage fidelity involve joint measurements, these bounds 
can be obtained asymptotically with separable measure- 
ments. This requires classical communication among 
the sequential von Neumann measurements performed 
on each of the N individual copies of the state. This 
result, which has been speculated on quantum statisti- 
cal grounds, is here proved for the first time by a direct 
calculation. Since the purity is an optimal measure of 
the entanglement of a pure bipartite qubit state, we also 
obtain the additional result that this entanglement can 
be optimally estimated with separable measurements on 
just one of the parties. 

For finite (but otherwise arbitrary) N, finding the op- 
timal separable measurement protocol is an open prob- 
lem. Interestingly enough, a 'greedy' protocol designed 
to be optimal at each measurement step leads to an 
unacceptably poor estimation. Notice that in the one- 
step adaptive protocol described above, part of the copies 
were spent ('wasted' from a 'greedy' point of view) in es- 
timating n. We have seen that this strategy pays in the 
long run. However, the 'greedy' strategy optimizes mea- 
surements in the short run, which translates into measur- 



ing u along the same arbitrarily fixed axis. This yields a 
low value for the fidelity, which does not even converge 
to unity in the strict limit N — > oo. This counterin- 
tuitive behavior also appears in other contexts as, e.g., 
economics, biology or social sciences (see e.g. Hi]). 
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